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Abstract 

■ This work is concerned with autoregressive prediction of turning points in fi- 

nancial price sequences. Such turning points are critical local extrema points 

O: 

along a series, which mark the start of new swings. Predicting the future time 
of such turning points or even their early or late identification slightly before 
^ ■ or after the fact has useful applications in economics and finance. Building 

on recently proposed neural network model for turning point prediction, we 
^ ■ propose and study a new autoregressive model for predicting turning points 

of small swings. Our method relies on a known turning point indicator, a 
\ Fourier enriched representation of price histories, and support vector regres- 

sion. We empirically examine the performance of the proposed method over 
a long history of the Dow Jones Industrial average. Our study shows that the 
proposed method is superior to the previous neural network model, in terms 
OA ! of trading performance of a simple trading application and also exhibits a 

quantifiable advantage over the buy-and-hold benchmark. 
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1. Introduction 

We focus on the difficult task of predicting turning points in financial 
price sequences. Such turning points are special in the sense, that they reflect 
instantaneous equilibrium of demand and supply, after which a reversal in 
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the intensity of these quantities takes place. These reversals can often result 
from random events, in which case they cannot be predicted. The basis 
for the current work is the hypothesis that numerous reversal instances are 
caused by partially predictable dynamics generated by market participants. 
We are not concerned here in deciphering this dynamics and extracting its 
mathematical laws, but merely focus on the question of how well such turning 
points can be predicted in an autoregressive manner. 

Turning points can be categorized according to their "size", which is 
reflected by the duration and magnitude of the trends before and after the 
reversal. Long term reversals are often termed business cycles. A well known 
type of a long term reversal is the Kondratiev wave (also called supercycle) , 
whose duration is between 40 to 60 years [if. Such waves, as well as shorter 
term business cycles, are extensively studied in the economics literature. 
However, our focus here is on much shorter trend reversals whose magnitude 
is of order of a few percents and their cycle period is measured in days. One 
reason to study and predict such "mini reversals" is to support traders and 
investors in their analysis and decision making. In particular, the knowledge 
of future reversal times can help financial decision makers in designing safer 
and more effective trading strategies and can be used as a tactical aid in 
implementing specific trades which are motivated by other considerations^] 

Our anchor to the state-of-the-art in predicting turning points is the paper 
by Li et al. upon which we improve and expand. Hypothesizing that 
the underlying price formation process is governed by non linear chaotic 
dynamics, the Li et al. paper proposes a model for short term prediction using 
neural networks. They reported on prediction performance that gives rise to 
outstanding financial returns through a simple trading strategy that utilizes 
predictions of turning points. Li et al. casted the turning point prediction 
problem as an inductive regression problem whose feature vectors consist of 
small windows of the most recent prices. The regression problem was defined 
through a novel oscillator for turning points that quantifies how close in 
hindsight a given price is to being a peak or a trough. Using feed-forward back 
propagation, they regressed the features to corresponding oscillator values 
and learned an ensemble of networks. Prediction of the oscillator values 
was extracted from this ensemble as a weighted decision over all ensemble 



Notice also, that the inverse of an (accurate) turning point predictor is, in fact, a trend 
continuation predictor. 
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members. 

Our contribution is two-fold. First, we replicate the Li et al. method and 
provide an in-depth study of their approach. Our study invalidates some 
of their conclusions and confirms some others. Unfortunately, we find that 
their numerical conclusions, obtained for a relatively small test set (spanning 
60 days), are too optimistic. We then consider a different learning scheme 
that in some sense simplifies the Li et al. approach. Instead of ensemble 
of neural networks we apply Vapnik's support vector regression (SVR). This 
construction is simpler in various ways and already improves the Li et al. 
results by itself. An additional improvement is achieved by considering more 
elaborate feature vectors, which in addition to price data also include the 
Fourier coefficients (amplitude and phase) of price. The overall new model 
exhibits more robust predictions that outperform the Li et al. model. While 
the resulting model does exceed the buy-and-hold benchmark in terms of 
overall average return, this difference is not statistically significant. However, 
the model's average Sharpe ratio is substantially better than the buy-and- 
hold benchmark. 



2. Related work 

Turning points have long been considered in various disciplines such as 
finance and economics, where they are mainly used for early identification of 
business cycles, trends and price swings. Among the first to study turning 
points were Burns and Mitchell jjij], who defined a turning point in terms of 
business cycles with multiple stages. Bry and Boshan j3] proposed a proce- 
dure for automatic detection of turning points in a time series in hindsight. 
Many of their successors refined this scheme, e.g., Pagan and Sossounov j^]. 

Such methods for turning points detection (in hindsight) facilitated the 
foundation of the study of turning points as events to which empirical prob- 
ability can be assigned and statistical analysis can be performed. Wecker 
j6[ developed a statistical model for turning points prediction while utilizing 
some of the ideas of Bry and Boshan. Another approach was proposed by 
Hamilton where he modeled turning points as switches between regimes 
(trends) that are governed by a two-state Markov switching model. The 
model was extended in (sj to include duration-dependent probabilities. 

There are few works that specifically discuss turning points in stock prices. 
Lunde and Timmerman [i| applied a Markov Switching model with duration 



dependence for equities. Bao and Yang [lOj applied a probabilistic model 
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using technical indicators as features and turning points as events of interest. 
Azzini, et al. ll| used a fuzzy-evolutionary model and a neuro-evolutionary 
model to predict turning points. The present paper is closest to the work 
of Li et al. 0], who proposed to use chaotic analysis and neural networks 
ensembles to forecast turning points. 



3. Preliminaries 

Let X = xi, x 2 , ■ ■ ■ , x t , . . . be a real sequence, x t G R. In this paper 
the elements Xt are economic quantities and typically are prices of financial 
instruments or indices; throughout the paper we call Xt prices and the index t 
represents time measured in some time frame. Our focus is on daily sequences 
in which case t is an index of a business day; our results can in principle be 
applied to other time frames such as weeks, hours or minutes. Given a price 
sequence X, we denote by X% + =£$,..., xi+n a consecutive subsequence of 
N prices that starts at the ith day. 

Throughout the paper we will consider autoregressive prediction mech- 
anisms defined for price sequences X. For each day t we consider a recent 
window of m prices, W t = X^Z m called the backward window of day t. The 
prices in the backward window may be transformed to a feature space of 
cardinality n via some encoding transformation. 

3.1. Turning points and their properties 

Let X be a price sequence. A turning point (TP) or a pivot in X is a time 
index t where a local extremum (either minimum or maximum) is obtained. 
A turning point is called a peak if it is a local maximum, and a trough if it is 
a local minimum. Examples for peaks and troughs are shown in Figure [TJ 

Ignoring commissions and other trading "friction," a trader who is able to 
buy at troughs and sell at peaks, i.e., to enter/exit the market precisely at 
the turning points, would gain the maximum possible profit. For this reason, 
successful identification and forecasting of turning points is extremely lucra- 
tive. However, even if all troughs and peaks were known in hindsight, due 
to friction factors including commissions, bid/ask spreads, trading liquidity 
and latency, attempting to exploit all fluctuations including the smallest ones 
may result in a loss. Therefore, one of the first obstacles when attempting to 
exploit turning points is to define the target fluctuations we are after, so as to 
ignore the smaller sized ones. To this end, we now consider three definitions, 
each of which can quantify the "size" of turning points. 
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Figure 1: Examples for peaks and troughs 



Definition 1 (Pivot of degree K). The time index t in a time series is 
an upper pivot or a peak of degree K, if for all j = 1,2, ...,K, Xt > Xt-j 
and Xt > Xi + j . Similarly, t is a lower pivot or a trough of degree K , if for 
j = 1, 2, K, x t < x t -j and x t < x t+j . 

A trough of degree 10 is depicted schematically in Figure [2a] A turning 
point is any pivot of degree at least 1. By definition, at time t one has 
to know the future evolution of the sequence for the following K days in 
order to determine if t is a pivot of degree K. Typically, pivots of higher 
degree correspond to larger price swings. Therefore, such pivots are harder 
to identify in real time. 

The following two definitions consider two other properties of pivots that 
reflect their "importance" . These properties will be used in our applications. 
Definition [2] is novel and Definition [3] is due to 01. 
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Definition 2 (Impact of a turning point). The upward impact of a trough 
t is the ratio max{i t , . . . ,x n }/x t , where n is the first index, greater than t, 
such that x n < Xt- That is, if the sequence increases after the trough t to 
some maximal value, x max , and then decreases below Xtj the impact is the 
ratio x max /xt- If Xt is the global minimum of the sequence, then the numer- 
ator is taken as the global maximum appearing after time t. The downward 
impact of a peak is defined conversely. 

Definition 3 (Momentum of a turning point [2]). The upward momen- 
tum of a trough t with respect to a lookahead window of length w is the 
percentage increase from xt to the maximal value in the window X^f™ = 
x t+ i, . . . , x t +w That is the upward momentum is max{%i, . . . , x t+w }/x t ■ 
The downward momentum of a peak is defined conversely. 




10 points 10 points wdays 



(a) Pivot point (trough) of(b) Turning point of im-(c) Turning point with mo- 
degree > 10 pact 7 mentum 7 with respect 

to a lookahead window of 

length w 

Figure 2: Turning points schematic examples 

In Figure [2] we schematically depict these three characteristics (i.e., pivot 
degree, impact and momentum). Then, in Figure |3] we show examples on a 
real price sequence. Notice, that these definition give rise to quite different 
turning points identification. 

3.2. Alternating pivots sequence 

Given a time series X and required characteristics of turning points (e.g., 
pivots of degree 10, or pivots with impact 7, etc.) we would like to extract 
from X an alternating sequence A(X) of turning points. The sequence A(X) 
is then used to construct a turning point oscillator (in Section [373]) . Following 
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(a) Turning points (pivots) of degree K=10 



(b) Turning points with impact 7 
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(c) Turning points with momentum 7= 
10% with respect to a lookahead window 
of length w=6 days 



Figure 3: Examples of turning point types over the DJIA, 5/2008-5/2009 
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we require that the alternating sequence A(X) will satisfy the following 
requirements: 

1. Only pivots with the required characteristics will be included in A(X). 

2. The pivot sequence will alternate between peaks and troughs. 

3. With the exception of first and last elements, every trough will corre- 
spond to a global minimum in the time interval defined by the pair of 
peaks surrounding it, and vice versa - every peak is a global maximum 
in the time interval defined by the troughs surrounding it. 



In Appendix A we present an algorithm that extracts an alternating pivot 
sequence that satisfies the above conditions. In Figure H] an example is given, 
showing three steps of this algorithm corresponding to the above three re- 
quirements. The proposed algorithm is by no means the only way to compute 
a proper alternating sequence. We note, however, that any algorithm that ex- 
tracts an alternating pivot sequence as defined above must rely on hindsight. 
Therefore, in real time applications the use of the algorithm is restricted to 
training purposes. 

When extracting alternating pivot sequences, different requirements will, 
of course, result in different pivot sequence. Figure depicts the resulting 
alternating pivot sequences for three different pivot requirements, when the 
input sequence X consists of 100 days the Dow Jones Industrial Average 
(DJIA) index from 05/11/2004 to 1/4/2005. 

3. 3. A turning points oscillator 

Unlike typical regression problems, where one is interested in predicting 
prices themselves, when considering turning points, it is not clear at the out- 
set what should be the target function. Here we adapt the solution proposed 
by Li et al. model j^j. Fixing a class of pivot points (satisfying any desired 
characteristic), the idea is to construct an "oscillator," whose swings cor- 
respond to price swings and its extrema points corresponds to our turning 
points in focus. The oscillator essentially normalizes the prices so as to as- 
sign the same numerical value (0) to all troughs and the same value (1) to 
all peaks. This oscillator provides the target function to be predicted in our 
regression problem. The construction of the oscillator is based on alternating 
pivot sequences as discussed in Section 13.21 Throughout the paper, when- 
ever we consider a pivot sequence, the meaning is that we refer to pivots of 
a certain type without mentioning this type. In the empirical studies that 
follow, this type will always be specified. 



S 
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(a) Find all the turning points of momentum 
7 = 0.05, w = 6 




(b) Ensure alternation of peaks and troughs. 
Points 2,5,6,7 are eliminated 




(c) A peak should represent the highest 
point between the troughs. Point 4 is re- 
placed with point 5 maximal TP with 
the defined properties between the troughs 
3 and 8 

Figure 4: Turning points selection example for momentum turning points 
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(c) Turning points of momentum 7 = 0.03, 
lookahcad p = 6 



Figure 5: Examples of turning points for different pivot requirements. The turning points 
that satisfy a requirement are denoted with circles 

Definition 4 (Turning point oscillator (TP Oscillator)). Let X be a 

price sequence and let A(X) be its alternating pivots sequence (that is, A(x) 
is the list of turning point times in X). The TP Oscillator is a mapping 
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r:N-> [0,1], 



T(t) 



x t -P(t) 



if t is a trough, 
if t is a peak, 

otherwise, 



(1) 



P(t)~T(t) ' 

where P(t) and T(t) are the values of the time series at the nearest peak and 
trough located in opposite sides of time t. 

Notice, that for each time index t, the TP Oscillator represents the degree 
of proximity of the price x t to the price at the nearest peak or trough. Prices 
that are closer to troughs will have lower values and prices that are closer to 
peaks will have higher values. The TP Oscillator is clearly bounded in the 
interval [0, 1] and attain the boundary values at troughs and peaks. 

The structure of the TP oscillator strongly depends on the type of turning 
points used in its construction. For instance, in Figure E] we see examples 
of the TP Oscillator computed for turning points with impact 7 = 1% (|6aj) . 
and impact 7 = 5% ( l6b|) . As should be expected, there are more peaks and 
troughs in Figure (|6a|) than in Figure (l6b|) because the number of pivot points 
of smaller impact is larger, so the TP oscillator attains its extreme values 
more frequently. 





40 60 

TP Oscillator 



so 100 



(a) 7 = 0.01 (b) 7 = 0.05 

Figure 6: TP oscillator for impact turning points 
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4. On the choice of features for turning points prediction 



In many prediction problems the choice of features is a crucial issue with 
tremendous impact on performance. From a learning theoretic perspective 
this choice should be done in conjunction with the choice of the model class. 
The overall representation (features plus model class) determines learnability 
and predictability. Since we focus in this paper on autoregressive models, the 
features we consider are limited to multivariate functions of past data. 

In the Li et al. paper ji[ , on which we build, it was advocated that price 
evolution is the outcome of a nonlinear chaotic dynamics and therefore, they 
used tools from chaos theory to determine the length of the backward window 
of prices from which to generate the features. The features themselves where 



simply normalized prices. Specifically, based on Takens theorem [12j, the 
length of this backward window was derived as the minimum embedding 
dimension of the training price sequence. The TP Oscillator used by Li et 
al. was defined in terms of momentum turning point. Recalling Definition [21 
momentum pivots are characterized via two parameters: the 'importance' 
parameter 7, and the lookahead window length. Li et al. focused on short 
term oscillations, and the choice of the corresponding lookahead window 
length was based on the Lyapunov exponent^ It is perhaps surprising that 
their chaotic dynamic analysis resulted in a conclusion that the past eight 
prices are sufficient for predicting the turning points. 

We decided not to take the Li et al. choice of representation for granted 
and performed an initial study where we considered several types of features 
and backward window lengths. In our study we employed a 'wrapper' ap- 
proach 13| where we quantified the performance of the features within the 
entire system (so that trading performance determined the quality of the 
features) . 

The conclusion of our study was that the better performing features 
(among those we considered) are normalized prices within a backward win- 
dow as well as the Fourier coefficients of those prices. The Fourier coefficients 
were simply the phase and amplitude coefficients resulting from a standard 
application of the discrete Fourier transform over the backward window. Our 
initial study confirmed that the better performing backward window length 



2 In chaotic dynamical systems, the horizon of predictability, which is directly affected 
sensitivity to initial conditions, is inversely proportional to the maximal Lyapunov expo- 
nent. 
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is eight, exactly as the Li et al. conclusion. 



5. Predicting turning points with Support Vector Regression 

In this section we describe our application of Support Vector Regression 
(SVR) to predict the TP oscillator (presented in Section [373]) . We refer the 



unfamiliar reader to 14| and 15| for comprehensive expositions of SVR. 

The SVR application is as follows. We train an SVR to predict the 
TP Oscillator T(t) as a function of the features, which are extracted from 
price information contained in the most recent backward window. Denote 
by F : W 71 — > M. d the feature generating transformation from prices in the 
backward window W t = X^Z m to the feature space. Thus, F{W t ) is a d- 
dimensional feature vector. The specific transformation F we used, consisted 
of normalized prices combined with normalized Fourier coefficients as de- 
scribed in Section HJ The label corresponding to the feature vector F(Wt) of 
the t th backward window, is set to T(t), the TP Oscillator at time t. This way 
we consider a training set S n = (F(W t ), r(t)), t\ < t < £2, of n = t 2 — ti + 1 
consecutive pairs of features with their labels. 

Using S n we train a support vector regression model. To this end we use 
an e-SVR model (and a training algorithm) [l6[ using a radial basis function 
(RBF) kernel, K(xi,Xj) = exp(— -^\\xi — Xj\\ 2 ). The model is controlled 
by three hyper-parameters C, the error cost, e the tube width, and a, the 
kernel resolution. The reader is referred to 1|1 for a discussion on the role 



of these parameters. The output of the SVR training process is a function 
r(t) : M. d — ?• R, from feature vectors to the reals. In our case, the function T 
is the SVR functional approximation to the TP oscillator, induced from the 
training set. Thus, larger values of T reflect closer proximity to peaks, and 
smaller values, closer proximity to troughs. 

Since T(t) is only an approximation of T(t) reflecting relative proximity to 
extrema points, we cannot expect that its own extrema points will explicitly 
identify the peaks and troughs themselves. Therefore, in order to decide 
what are the magnitudes of the predicted values that should be considered 
as peaks or troughs, we introduce thresholds Ti ow , T high so that indices t such 
that T(t) < T [ow , are all treated as troughs, and conversely, indices t for 
which T(t) > T high , are all considered as peaks. The thresholds Ti ow and 
Thigh are hyper-parameters that are part of our model and should be fitted 
using labeled training data. 
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5.1. Problem specific error function and optimization 

Various error functions are used in regression analysis. The most common 
ones are the root mean square error (RMSE) and the mean absolute error 
(MAE). For example, the RMSE of the prediction T(t) over the subsequence 
X l t l = x tl , . . . , x t2 of n = t 2 — ti + 1 inputs, is given by, 



Instead of directly using the RMSE, Li et al. |2j suggested to use a problem 
specific variant of the RMSE. This specialized error function is defined in 
terms of the following trimmed reference function, 



The final specialized error function, denoted TpRMSE, is then defined 
in terms of the reference function as, 



TpRMSE = TpRMSE(r(t),r(t),T low , T high ) 4 (I ^(f (t) - f(t)) 2 ) *. 



We observe that T (t) — f (t) has non-zero values at values t corresponding to 
wrong predictions, thus allowing to form an error function that penalizes such 
deviations according to their magnitude. An example shown the TpRMSE 
error function is given in Figure [7J 

Remark 5. Li et al. JgJ proposed to use the following refinement of the 
TpRMSE error function, which assigns larger weights to errors occurring 
precisely at true turning points, and smaller weights for other errors, 



1 i 
RMSE = RMSE(t(t),T(t), X^) = (- ^(f(t) - T(t)) 2 ) 5 . 



r'(t)Mi; 




TpRMSE^ (±^A(f'(t)-f(t)) 2 )*, 
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Figure 7: Calculation of TpRMSE. The upper graph shows the values of the actual TP 
Oscillator T(t) and the predicted TP Oscillator T(t). The lower graph identifies regions 
that actually influence the error magnitude, expressed at each point as (T'(t) — T(t)) 2 . 
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The exact choice of the (3 coefficients can be found in J|]/. In our experiments, 
whenever we applied the Li et at model we used this refined TpRMSE error 
function. However, we did not find any significant advantage to this refined 
error function and our SVR model was applied with the simpler TpRMSE 
error function. 

Given a fixed SVR regression estimate T(t), we need to optimize the thresh- 
olds Thi g h and Ti ow , so as to minimize the TpRMSE error. The optimal 
thresholds are thus given by, 

[T low , T high ] = argmin TpRMSE(T(t), f (t), T^, T Mgh ). (2) 

t! t! 

low'' high 

Our overall turning point SVR model is specified by the following hyper- 
parameters. The SVR hyper-parameters are C, e and a, and the turn- 
ing point identification hyper-parameters are Ti ow and Thigh- These hyper- 
parameters need to be optimized over the training sequence. To this end, 
we split the training sequence X into two segments, one for optimizing the 
SVR parameters and the other, for optimizing the thresholds (in the sequel 
we call this second segment of the training sequence the validation segment). 
Due to the complexity of the error functional we resorted to the exhaus- 
tive grid search. Specifically, both sets of hyper-parameters were selected 
exhaustively over appropriate set of values (a grid), so as to minimize the 
error functional. The SVR grid is denoted 0(C, a, e) and the thresholds grid 
is denoted Q(Ti ow , Thigh)- In Section [931 we discuss the particular choices of 
these grids. The overall model selection strategy is summarized in Algorithm 

m 

Algorithm 1 SVR TPP model 
for all (C,a,e) i Q(C,a,e) do 

Train an SVR model on the training data segment 

Optimize {T low ,T hi gh) <E Q{T low ,T hi gh) on the validation segment using 

Store TpRMSE and (Ti 0W ,T h i g h) for the given triple (C,a,e) 
end for 

Select the (C, a, e) obtaining the minimal TpRMSE score on the validation 
sequence with their corresponding optimal thresholds (Ti ow , Thigh)- 



We note that Li et al. |2[ applied a genetic algorithm to optimize the 
corresponding thresholds (Ti ow , Thigh) in their model. However, we found 
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that the exhaustive search over a grid performs the same or better than a ge- 
netic algorithm, with tolerable computational penalty. The main advantage 
in using our deterministic approach, rather than the randomized genetic al- 
gorithm, is the increase of reproducibility (our optimization is deterministic 
and always has the same outcome). 

6. A trading application 

Prediction algorithms can be evaluated through any meaningful error 
function. For example, we could evaluate our methods using the TpRMSE 
function defined above. However, the degree of meaningfulness of an error 
function depends on an application. Perhaps the most obvious application of 
turning points predictions is trading. We now describe a very simple trading 
strategy implementing the infamous buy low sell high adage. This strategy 
will be used to assess performance of our method, including comparisons to 
a natural benchmark and to the state-of-the-art model of [2| . 

The idea is to use predictions from our already constructed regressor, T, of 
the TP Oscillator, T, to issue buy and sell signals. Given the prediction T(t) 
and the thresholds Ti ow and Thigh, trading operations are triggered according 
to the following rule. 



The strategy thus works as follows. If today we are not in position (i.e., we 
are out of the market) and a buy signal is triggered at the market close, we 
enter a long position first thing tomorrow, on the opening price. We start 
this trade by buying the stock using our entire wealth. As long as we are in 
the trade, we wait for a sell signal, and as soon as it is triggered, we clear 
our position on the opening of the following day. (i.e., clearing the position 
means that we sell our entire holding of the stock). 

To evaluate trading performance we will use cumulative return, maximum 
drawdown, success rate and the Sharpe ratio measures. These are standard 
quantities that are often used to evaluate trading performance. To define 
these measures formally, let x^ +n be a price sequence of length n, and let 
{{pi, Si), hi, Si E [t,t + n],i = 1, . . . , L}, be L pairs of times corresponding to 
matching buy and sell "triggers" generated by a trading strategy S. This 
pairs correspond to L buy/sell trades. Thus, for any i, we bought the stock 
at price and sold it at price x s .. 
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The Cumulative return, rets{x t t ¥n ), of a trading strategy S with respect 
to the price sequence x 1 ^™ , is the total wealth accumulated at the end of a 
test period, assuming full reinvestment of our current wealth in each trade, 
and starting the test with one dollar, 

ret s (x^) = U—- ( 3 ) 

X bi 

The corresponding annualized cumulative return, RET$ {x\ +n ) is@ 

RET s (x? n ) = [ret s (x* +n )] 252/n . (4) 

It is often informative to consider also the rolling cumulative return, 
ROCs(t), which is the curve of cumulative return through time, 

R0Cs(t) J^«tiW ; °l n .« = « + !.-.« + " <«) 

yxt/xt-x, if t e \J{[ b i,Si\) 

The maximum drawdown (MDD), MDDs(x t t +n ), is a measure of risk, 
defined with respect to the rolling cumulative return curve. Consider Figure 
[HI depicting a hypothetical (rolling) cumulative return curve. The MDD of 
this curve is defined to be the maximum cumulative loss from a peak to the 
following minimal trough during a trading period. The MDD is emphasized 
in the Figure [8] by the measured height. Formally, if ROCs{k) is a the 
cumulative return sequence, its MDD in the time interval [t, t + n] is, 

MDD s (x\ +n ) = max { max \ROC s (k) - ROC s (r)}}. (6) 

re[t,t+n] ke(t,r) 

The Sharpe ratio(SR), sharpes( risk-adjusted return measure 

17| . Intuitively it characterizes how smooth and steep the rolling cumulative 
return curve is, 

It is common to annualize SR in order to be able to compare SR for 
different time periods. Thus, for daily data, the annualized Sharpe Ratio 
(ASR) is, 

SHARPE s (x\ +n ) = RET s {x^)-l 

V 1 ' std{ROC s {x\ +n )) ■ V252 V ' 



3 There 252 business days in a year. 
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MDD(t,,U 



Figure 8: Maximum drawdown (MDD) example 



The last metric we introduce is the rate of success, RATEss{x t 
is simply proportion of successful trades, 



RATE Ss {x 



L 



which 



(9) 



As a natural benchmark for trading performance we take the buy-and-hold 
(BAH) strategy, which simply buys the stock at the beginning of the period 
and then sells it in the end of the period. While this strategy is very simple 
(and perhaps naive), it makes much sense in the stock marketQ Calculations 
of the above performance metrics for the BAH strategy are straightforward, 
according to the above formulas ((j3J)) - ((jHJ)) by assigning bi = t, si = t + n, 
and L = 1. 

6.1. Example: predicting turning points with SVR 

In Figure I9"al we demonstrate predictions obtained by our SVR model over 
an out-of-sample (i.e., test) data. In this example we restricted attention 
to turning points with impact 7 = 0.01, and trained the model over one 
year of the DJIA index from 07-Sep-2004 to 09-Nov-2005. The SVR hyper- 
parameters and the thresholds were selected over a validation segment from 
ll-Nov-2005 to 30-Jan-2006, and predictions were performed for an out-of- 
sample test period from 08-Feb-2006 to 25-Apr-2006 (this period is shown in 
the figure). In this example feature vectors consisted of the past eight prices 
normalized to reside in [0, 1]. The SVR hyper-parameters were optimized 



In fact, many economists believe that BAH is the only strategy that makes sense [18] 
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over the grid 6 (C, a, e) = [2~ 10 , 2~ 9 ...2 10 ] x [2~ 10 , 2~ 9 ...2 10 ] x [0.01,0.05,0.1]. 



The best model parameters that were found in this grid were C = 32, 7 = 
0.0625, e = 0.05. The turning points identified by this model are marked 
in the figure. We note that the prediction problem (data and train/test 



periods) in this demonstration is precisely the same as the one used in [2j for 



evaluation. 

7. Predicting turning points with neural networks 

In this section we briefly describe the original turning points prediction 
model of Li et al. jij , which relies on neural networks. In their methods, the 
feature vector consists of normalized prices within a recent backward win- 
dow. The size of this window is determined by calculating the embedding 
dimension over the training sequence. The target function to be regressed 
and predicted is the TP Oscillator restricted to turning points with momen- 
tum 7 and lookeahead window length which is calculated using the Largest 
Lyapunov exponent of the training sequence (see details in jij). The training 
set is constructed as described in Section 19.11 The model trains a number of 
neural networks, which differ in the number of neurons in their single hid- 
den layer. These networks form an ensemble whose prediction is generated 
by aggregating individual networks outputs using their weighted sum. These 
ensemble weights, as well as the prediction thresholds, are optimized to mini- 
mize the TpRMSE function (see Section I5TT1) over a validation sequence. For 
further details on how the backward window size is determined, the reader 
is referred to 0]. We only note that the empirical studies conducted in ^ 
considered two price sequences, DJIA including 411 points, and TESCO con- 
sisting of 492 points. In both cases the backward window length was taken 
to be eight days. 

8. Comparison between the SVR and the ANN methods 

Having described the SVR and the ANN models in Sections [5] and [7J 
respectively, we would like to emphasize the resemblance and differences 
between these two methods. Both methods are focused on predicting turning 
points and in general, we were motivated by the Li et all. paper jij, followed 
their general outline, and drew on a number of their ideas, including the 
utilization of their TP oscillator and their loss function. Our variant differs in 
several aspects. Aside from the different regression algorithm (their ensemble 
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5 10 15 20 25 30 35 40 45 50 



Actual TP Oscillator 

- - Predicted TP Oscillator 



(a) TP Oscillator and its prediction 




(b) Applying TP Oscillator and thresholds for TP prediction. The upper 
chart depicts prices, circles mark the actual turning points and squares 
mark predicted turning points. The lower chart depicts the TP Oscillator 
prediction, the horizontal lines are the thresholds. 

Figure 9: Demonstration of SVR model predictions over an out-of-sample DJIA data 
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of neural networks vs. our support vector regression), our model relies on a 
different feature set and utilizes a different target function. Specifically, the 
feature set in j^] is a window of the past prices. In our case, the features are 
both prices and the Fourier coefficients of the prices. The target function 
used in ^ is the TP Oscillator restricted to momentum turning points of 
particular magnitude (and lookahead window length). In our case it is the 
TP Oscillator of defined with impact turning points of particular magnitude. 
The differences between our approach and theirs are summarized in Table [TJ 





[2] 


Our work 


Feature vector length 


Chaotic embedding dimension 


Optimized length (hyper-parameter) 


Turning point property 


Momentum 


Impact 


Regression algorithm 


Ensemble of neural networks 


Support Vector Regression (SVR) 


Input representation 


Raw prices 


Raw prices and Fourier coefficients 



Table 1: Summary of differences between ANN and SVR models 



9. Experimental design 

We conducted an extensive set of experiments to evaluate the SVR and 
ANN turning points prediction models discussed in Sections |5] and [7J respec- 
tively. Our evaluation is performed in terms of the simple trading application 
introduced in Section UJ We evaluated the models over historical price seg- 
ments of the Dow Jones Industrial Average (DJIA). Since financial sequences 
such as the DJIA exhibit great many behaviors and are extremely noisy, we 
considered in our experiments many predicting tasks, corresponding to many 
sequence segments along the DJIA history. To facilitate the discussion we 
first define and discuss in the following subsections the essential technical 
aspects in our experimental design. 

9.1. Train, validation and test splits 

Each instance of a prediction task is a triplet of price subsequences of 
the financial sequence in question (DJIA) consisting of training, validation 
and test segments, as depicted in Figure [KU Thus, the prices in each of 
the segments are contiguous and the segments follow each other chronolog- 
ically. The training segment is used to fit model parameters (for the SVR 
or ANN models). The validation segment is used for model selection via 
hyper-parameter tuning, and the test segment is utilized to evaluate perfor- 
mance. Each such triplet is called a prediction task. We denote the lengths 
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of the three segments by N train , N va [ id and N test , respectively, and the choice 
of these parameters in our experiments will be discussed later. 



Train se gment Validatio n segment^ Test segment 

~Y Y — Y - 



Figure 10: Splitting data into train, validation and test segments 

9.2. Statistical validity 

Our tests consider multiple methods and multiple prediction tasks. To en- 
sure statistical validity of our conclusions, we utilize the following statistical 
tests. 

9.2.1. Tests for pairwise comparisons 

When comparing the performance of two algorithms, or two instances of 
an algorithm (i.e., the same algorithm applied with different parameters) over 
multiple tasks, we use the Wilcoxon Signed-Rank test, as recommended and 
described in [l_9[ Section 3.1.3. For a pair of algorithms of interest, we conduct 
the test and calculate the p-value. If the the performance difference can be 
accepted at 5% level, we conclude that one of the algorithms is significantly 
better than the other (at 5% significance level); otherwise, we conclude that 
the algorithms are statistically indistinguishable. 

9.2.2. Tests for multiple comparisons 

When comparing a group of algorithms (more than two) over multiple 
tasks we use the Friedman test to determine the rank of each of algorithm. 
In this case the p-value indicates whether differences between the algorithms 
are significant. If significant differences are observed, we proceed with a 
post-hoc test using the Bonferonni-Dunn test for pairwise rank comparisons 
in order to find out the best perfor ming algorithm in the group. The precise 



procedure we follow is described in (19J Section 3.2.2 
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9.3. Software 

All experiments were conducted under Matlab. For training the ANN 
models we used the Matlab Neural Network toolbox. The SVR models were 



computed using the SVMLib toolbox [16 



9-4- Dataset 

Following j2| our data set consisted of close prices of the DJIA indexjf] 
We analyzed a large DJIA segment from 1/1/1960 to 1/1/2010. Spanning 50 
years, this sequence contains 12585 prices. In this time range we selected 300 
prediction (and trading) problem tasks uniformly at random. This quite large 
number of problem instances was chosen to ensure statistical significance. 
In particular, the statistics reported are typically averaged over these 300 
instances. As discussed in Section 19. 1[ each of the 300 tasks is a triplet 
consisting of consecutive train, validation and test segments. The lengths of 
these segments are N train = 504 days (two years), N va i idate = 60 days, and 
Ntest = 60 days. Finally, we note that the 300 test periods in our prediction 
tasks sometimes overlap, but in general they are uniformly spread along the 
50 year periodic 

9.5. Details of the SVR model and its hyper-parameters 

Our SVR model is described in Section [5j To actually apply this model, 
we need to make some choices regarding representation. Specifically, we need 
to choose the type of turning points to focus on, either pivot (of a certain 
degree), impact or momentum, and in each case select desired resolution, 
controlled by the pivot degree or the parameter 7 (see Section I3.ip . The 
SVR model itself is controlled by three hyper-parameters: C, a and e. The 



role of these parameters is described in (15(. Turning point identification is 
achieved using two additional hyper-parameters, T[ ow and Thigh, as described 
in Section [6J The SVR model selection is performed using a straightfor- 
ward grid search for the best triplet of SVR hyper-parameters. This SVR 
parameter space (grid), denoted 0(C, a, e), was chosen to be 

e(C, a, e) = {0.1, 1, 100} x {0.1, 1, 100} x {0.01, 0.05, 0.1}. 



5 We downloaded the prices from Yahoo! Finance, http://finance.yahoo.com. 
6 The precise periods selected for these 300 tasks were recorded and can be obtained 
from the authors. 
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Similarly, the parameter space (grid) for the thresholds was taken to be 



®(T low ,T high ) = [0 1] x [0 1], with step 0.01. 

These choices were made based on a preliminary rough study on other price 
sequences before conducting the experiments and were not optimized there- 
after. 

9. 6. Details of the ANN model and its hyper-parameters 

The ANN model of Li et al. j2j is briefly introduced in Section [7J In 
order to replicate their model as accurately as possible we followed all their 
choices. The parameters for the backpropagation learning algorithm were: 

• Transfer function: hyperbolic tangent; 

• Output function: linear function. 

• Backpropagation learning function: gradient descent with learning rate 
0.01. 

The chaotic characteristics of the time series were used for selecting some of 
the parameters as follows: 

• The embedding dimension m of the training segment was used to deter- 
mine the backward window length. Embedding dimension calculations 



were performed using Cao's method [20J as implemented by Merkwirth 



et al. in their Matlab toolbox 21 



The time delay r was used to determine the sampling rate of the input 
data. The time delay was calculated in accordance with the mutual in- 
formation method described in[22|, and implemented in the Merkwirth 
et al. Matlab toolbox as well |2l| . 

The largest Lyapunov exponent A was used to determine the lookahead 
window length (required to define the impact turning points), and was 
set to 1/A, as was done in jij. This Lyapunov exponent was calculated 



using Rosenstein et al's method [23| as implemented by Hegger et al. in 



their TISEAN tool [24|. All these chaotic parameters were calculated 



over the training data segment. 
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Since the neural network training algorithm starts with random initial 
network weights, we performed standard 'random restarts' to initialize these 
weights. We used 10 random restarts for the training of each prediction 
task as is common in practice. Thus, we obtained 10 sets of results for each 
instance and selected the best performing one over the validation segment. 

With these methods for parameter selection we were able to reproduce the 
results of j^j for the particular prediction task (i.e., a particular training/test 
segment) used in their paper for evaluation, (this prediction task is used 
in our example of Section 16. ip . Since Li et al. used only one DJIA task 
for evaluation, one of our contributions is a more through analysis of their 
method using multiple tasks. 

9.7. Results 

In this section we present the results of our experiments. Throughout 
the presentation, SVR refers to the proposed model and ANN refers to the 
original method of 

9.7.1. Experiment 1: SVR vs. ANN vs. BAH 

The first set of results is a comparative study of SVR and ANN. For both 
methods, we experimented with two types of target functions. The functions 
we considered in this studv were generated using either momentum pivots 
(as originally proposed in |2j) of varying degrees, or impact pivots of varying 
degrees. The other parameters of these experiments are summarized in Table 

m 

ANN SVR 

Features Raw prices Raw prices and Fourier coefficients 

Input length Chaotic 8 

Time delay Chaotic 1 

Look-ahead Chaotic 

Ntrain (training segment length) 2 years 2 years 

N V aiidate (validation segment length) 60 days 60 days 

Ntest test segment length 60 days 60 days 

Table 2: Experiment 1: Parameters of the SVR and ANN models. The specification 
'Chaotic' in the ANN column means that the parameters were selected based on chaotic 
dynamics analysis as described in Section 

The results of the experiments are presented in Table IB. 121 The table 
summarizes the performance obtained with the two types of pivot points 



26 



(momentum and impact). For each pivot type we show the performance for 
a number of 7 values (recall that 7 specifies the resolution or "importance" 
of the pivot). At the lower part of the table the performance of buy-and- 
hold (BAH) is specified. It is evident that both methods suffer from greater 
values of 7 (5% and 10%) and very small values of 7(0.1%). Specifically, for 
these 7 values the mean returns are lower than the corresponding return of 
BAH, and the Sharpe ratios do not exceed by much the Sharpe ratio of BAH. 
With lower values of 7 (1% and 2%) ANN still fails to beat the BAH mean 
return, while SVR is better than the BAH in average, but this advantage 
is not statistically significant according to Wilcoxon signed rank test whose 
results are summarized in Table IB.8I in the appendix. 

With impact pivot points both ANN and SVR achieved good Sharpe 
ratios that are better than the BAH Sharpe ratio. This advantage is statisti- 
cally significant. In addition, SVR is superior to ANN in terms of both mean 
return and the mean Sharpe ratio. When considering the MDD measure, we 
observe that both SVR and ANN achieved better (smaller) MDD than the 
BAH. The best performing 7 in terms of Sharpe ratio is 7 = 2% (for both 
types of pivot types). This emphasizes that the proposed models are more 
useful for shorter term prediction of smaller sized fluctuations. 
Remark: The p- values corresponding of the Wilcoxon signed-rank test of 
this experiment appear in Table IB.8I in the Appendix. 

9.7.2. Experiment 2: SVR Backward window size 

We tested the SVR performance for short (4), medium (8) and large 
(50) backward windows lengths. Table H] presents the results of these tests 
for three types of turning points (2%-momentum, 2%-impact and pivots of 
degree 10). Table IB. 91 in the appendix shows a comparison of the SVR 
return, success rate, maximum drawdown and Sharpe ratio, for the three 
backward window length, with the corresponding metrics of BAH. that table 
also includes the Friedman ranks statistics of these tests. From this compar- 
ative analysis we can conclude that the small and medium backward window 
lengths allow SVR to perform significantly better than BAH in terms of 
Sharpe ratio. However, according to the Friedman ranks analysis, small and 
medium ASR belong to the same performance group, this means that no 
statistically significant difference between them was detected in the post-hoc 
test. 
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Mean Return^ 


Success Rate 


Mean MDD 


Mean ASR 


Turning points of impact 7 


7=0.1% 










SVR 


1.28% ±0.30% 


68.02% ± 1.66% 


2.17% ±0.20% 


0.94 ±0.13 


ANN 


1.10% ±0.25% 


61.08% ±0.97% 


1.75% ±0.12% 


0.91 ±0.12 


7=1% 










SVR 


1.81%±0.28%(*) 


69.89% ± 1.83% 


1.92%±0.18% 


1.34±0.12(*) 


ANN 


1.06% ±0.22% 


62.76% ± 1.27% 


1.98%±13.00% 


1 U-Lf| 11 
_L . ±4: zn U.ll 


7=2% 










SVR 


1.77% ± 0.22%(*) 


73.82% ± 1.87% 


1.73%±0.18% 


1.67±0.13(*) 


ANN 


1.02% ±0.18% 


66.17% ± 1.39% 


1.91%±0.15% 


1.13±0.10 


7=5% 










SVR 


0.59% ± 0.20% 


64.58% ±2.31% 


1.37%±0.18% 


1.26±0.13 


ANN 


0.65% ±0.13% 


63.47% ± 1.79% 


1.27%±0.12% 


1.04 ±0.11 


7=10% 










SVR 


0.54% ±0.16% 


76.10% ±2.31% 


0.63%±0.13% 


1.79 ±0.12 


ANN 


0.37% ± 0.09% 


61.27% ±2.04% 


0.57%±0.07% 


0.81 ±0.11 


Turning points of momentum 7 with respect to lookahead window w 




7 =0.1% 










SVR 


1.11% ±0.28% 


67.11% ± 1.65% 


2.22% ±0.20% 


0.82 ±0.12 


ANN 


0.98% ± 0.24% 


61.21% ±0.98% 


1.71% ±0.12% 


0.72 ±0.11 


7=1% 
SVR 


1.46%±0.28%(*) 


67.81% ±2.01% 


1.65%±0.18% 


1.26±0.13(*) 


ANN 


1.07% ±0.22% 


62.26% ± 1.27% 


1.90%±13.00% 


1.03 ±0.11 


7=2% 










SVR 


1.43%±0.26%(*) 


69.97 ±2.26% 


1.64%±0.17% 


1.4±0.13(*) 


ANN 


0.97% ±0.18% 


65.82% ± 1.39% 


1.67%±0.15% 


1.09 ±0.10 


7=5% 










SVR 


0.82% ±0.15% 


75.47% ± 2.20% 


0.65%±0.11% 


1.63 ±0.13 


ANN 


0.48% ±0.13% 


58.51% ± 1.79% 


1.09%±0.12% 


0.51 ±0.11 


7 =10% 










SVR 


0.23% ± 0.06% 


77.59% ± 2.35% 


0.23%±0.07% 


1.44 ±0.09 


ANN 


0.37% ± 0.09% 


55.55% ± 2.04% 


0.61%±0.07% 


0.69 ±0.11 



BAH mean return: 1.44 ± 0.44% 
BAH mean MDD:8.2 ± 0.41% 
BAH mean ASR:1.01 ± 0.15% 



Table 3: Performance comparison between the SVR and ANN models for turning points 
with impact and momentum characteristics. Whenever a result of one of the methods is 
better at 5% level, according to Wilcoxon signed-rank test, we mark it with (*). Boldface 
numbers mark results that exceed the corresponding BAH metric at 5% level. 
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Backward window Mean return Success Rate Mean MDD Mean ASR 

Turning points with impact 7 = 2% 

4 1.95% ±0.28% 65.84% ±1.74% 2.09%±0.22% 1.36±0.12 

8 1.77% ±0.22% 73.82% ±1.87% 1.73%±0.18% 1.67±0.12 

50 1.36% ±0.22% 70.44% ±2.27% 0.95%±0.14% 1.36 ±0.13 



Turning points with momentum 7 = 2%, lookahead window 6 days 

4 1.59% ±0.27% 66.41% ±1.71% 1.99%±0.17% 1.21±0.13 

8 1.43% ±0.26% 69.97 ±2.26% 1.64%±0.17% 1.4±0.13 

50 1.39% ±0.22% 69.90% ±2.32% 0.93%±0.14% 1.77 ±0.13 



Pivot turning points of degree 10 

4 1.21 ±0.28% 64.76 ±1.92% 0.11±0.00 2.32±0.22% 

8 1.23 ±0.20% 69.97 ±2.26% 0.26±0.12 1.33±0.15% 

50 0.35 ±0.31% 53.87 ±2.59% 0.09±0.00 1.50±0.25% 



BAH mean return:1.44± 0.44% 
BAH mean MDD:8.2 ± 0.41% 
BAH mean ASR:1.01 ± 0.15% 



Table 4: Performance dependence on backward window length. Boldface numbers are 
significantly better than the corresponding BAH metrics. The p-values of the pairwise 
tests with BAH and Friedman ranks are summarized in Table IB. 91 
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9.7.3. Experiment 3: SVR training segment length 

The goal of this experiment was to see how performance is influenced 
by the training segment length. To this end, we considered three lengths: 
short (0.5 year), medium (1 and 2 years) and long (5 years). The results are 
presented in the Table Based on the statistical analysis of these results 
(summarized in Table IB. 101 in the appendix), we conclude that the algo- 
rithm exhibits similar performance for these four training segment lengths, 
and this holds for the three types of pivot points (pivot degree, momentum 
and impact). While the average Sharpe ratios and returns are higher for 
longer training segments, the Friedman rank analysis cannot designate these 
differences statistically significant. 



Training set length Mean return 



Success Rate 



Mean MDD Mean ASR 



Impact turning points 

0.5 year 1.77% ±0.25% 66.84%±2.19% 

1 year 1.47% ±0.22% 70.95%±2.05% 

2 years 1.77% ±0.22% 73.82%±1.87% 

5 years 2.12% ±0.26% 73.92%±1.97% 



1.86%±0.18% 
1.79%±0.18% 
1.73%±0.18% 
1.90%±0.19% 



1.36±0.13 
1.38±0.12 
1.67±0.12 
1.67±0.12 



Momentum turning points 

0.5 year 1.07% ±0.27% 68.00% ±1.90% 1.66%±0.18% 1.23±0.13 

1 year 1.42% ±0.27% 68.52% ±1.80% 1.66%±0.17% 1.25±0.13 

2 years 1.43% ±0.26% 69.97±2.26% 1.64%±0.17% 1.4±0.13 

5 years 1.36% ±0.28% 65.44% ±2.13% 1.63%±0.19% 1.15±0.13 



Pivot degree turning points 

0.5 year 0.63% ± 0.20% 63.63% ±2.38% 1.63%±0.20% 0.96±0.12 

1 year 1.07% ±0.24% 65.97% ± 2.42% 1.72%±0.20% 1.23±0.13 

2 years 1.23% ±0.20% 69.97% ±2.26% 1.33%±0.15% 1.45±0.12 

5 years 1.08% ±0.23% 70.18% ±2.32% 1.44%±0.18% 1.35±0.14 



BAH mean return:1.44± 0.44% 
BAH mean MDD:8.2 ± 0.41% 
BAH mean ASR:1.01 ± 0.15% 



Table 5: Performance dependence on training segment length 
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9.7.4- Experiment 4'- reproduction of the Li et al. results in jl/ 

In this experiment the goal was to reproduce the results presented in |2| 
for the original ANN model with respect to the DJIA and TESCO price se- 
quences. Due to the use of randomization in the backpropagation training 
algorithm (used for random assignment of initial weights), we ran each experi- 
ment 20 times in order to receive a robust estimation of the result. The results 
are presented In Table 19.7.41 (containing sub-tables (a) for DJIA, and (b) for 
TESCO). All these experiments consider the same training/validation/test 
partition |^1 

As expected, due to this randomized ANN training, the 20 the returns 
obtained (for each dataset) vary considerably. The results reported in (ij are 
quite close to the best return among the 20. Specifically, Li et al. report on 
6.21 return for DJIA (vs. our 5.97), and 11.63 return for TESCO (vs. our 
10.94). 

The average returns we obtained in both experiments are, unfortunately, 
not as optimistic as reported in j2| and, in particular, in our experiments the 
ANN model could not outperform BAH. Using the SVR model over these test 
segments we obtained a return of 2.31% for DJIA and 7.83% for TESCO. 
These results are robust since no random selections are made in our SVR 
training and prediction. 

9.7.5. Experiment 5: ANN ensemble size 

The goals of this experiment were to test how the ANN performance is 
influenced by the ensemble size. In particular, we were also interested to see 
if a single network can perform as well as an ensemble. In the first experiment 
we ran the ANN model with a single network, with varying number of neurons 
in the hidden layer. In the second experiment we ran the ANN model with 
different ensemble sizes. In this case, the number of neurons in the hidden 
layer of each ensemble member was taken from the interval [m, m + n] where 
m is the backward window length and n is the ensemble sizejj 

Table [7] summarizes the performance of these settings. These results con- 



8 For the DJIA sequence, the training period is 7/9/2005-10/11/2005, the validation 
period is 11/11/2005-29/1/2006, and the test period is 30/1/2006-25/4/2006; for TESCO 
sequence the training period is 18/11/2004-31/-5/2006, the validation period is 1/06/2006- 
26/07/3006, and the test period is 27/7/2006-9/10/2006. 

9 This choice was not explicitly mentioned in |2j but was obtained from Li et al. by 
personal communication. 
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Return(%) 


Success Rate 


1.05 


66.67% 


3.85 


100.00% 


3.05 


100.00% 


2.91 


100.00% 


4.68 


75.00% 


3.26 


75.00% 


5.57 


100.00% 


3.82 


66.67% 


2.56 


66.67% 


4.50 


100.00% 


5.97 


100.00% 


2.80 


100.00% 


2.35 


100.00% 


5.42 


100.00% 


1.91 


66.67% 


3.90 


100.00% 


2.96 


75.00% 


0.81 


66.67% 


4.14 


100.00% 


3.85 


75.00% 


Average TPP return:1.39% ± 0.31% 
BAH: 3.91% 



(a) DJIA dataset 



Table 6: Attempting to replicate the Q ri 
DJIA and TESCO datasets 



Rcturn(%) 


Success Rate 


0.00 


NaN 


6.86 


100.00% 


8.86 


100.00% 


0.00 


NaN 


7.96 


100.00% 


3.53 


100.00% 


9.42 


100.00% 


0.00 


NaN 


0.00 


NaN 


3.46 


100.00% 


7.62 


100.00% 


0.00 


NaN 


0.00 


NaN 


0.00 


NaN 


7.06 


100.00% 


1.59 


100.00% 


5.82 


75.00% 


10.94 


100.00% 


7.89 


100.00% 


1.73 


100.00% 


Average TPP rcturn:4.14% ± 0.85% 
BAH: 6.37% 



(b) TESCO dataset 



is: 20 iterations of the ANN model on the 



firm that an ensemble, containing multiple networks outperforms a model 
with a single network. As for the ensemble size, in a direct comparison, 
no statistically significant difference between ensembles of sizes 2, 5, 10 or 
50 members was observed. Nevertheless, the ensemble of size 10 excelled, 
compared to the other three ensembles, in that it was the only one which 
outperformed BAH in terms of Sharpe ratio in a statistically significant man- 
ner (ensembles with 5,10 and 50 members obtained better Sharpe ratio than 
BAH). 

9. 7. 6. Some caveats regarding performance calculation 

Here we discuss caveats that should be accounted for in examining the 
above results. Our experimental protocol examined the trading performance 
during numerous test segments each of which fixed in length (60 days). Each 
trade is completed only when both entry and exit signals are generated. The 
first problematic situation occurred in instances where an entry signal was 
generated during the test segment but no matching exit signal was obtained 
during the same test segment. We could treat such instances in various ways 
(e.g., discard the trade, end it prematurely, or follow it till it end in another 
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Mean return Success Rate Mean MDD Mean ASR 



Single network 



Neurons number 
2 0.09%±0.07% 
5 0.30%±0.12% 
10 0.68%±0.31% 
50 0.88%±0.42% 



64.58%±6.29% 
58.90%±5.11% 
64.46%±3.42% 
53.13%±1.50% 



0.15%±0.05% 
0.43%±0.12% 
1.00%±0.15% 
1.59%±0.21% 



1.21±0.28 
1.00±0.24 
0.83±0.23 
0.59±0.23 



Ensemble of networks 



Members 
2 0.57%±0.20% 
5 1.33%±0.27% 
10 1.32%±0.33% 
50 1.35%±0.37% 



62.18%±4.02% 
63.60%±3.56% 
69.35%±3.01% 
63.68%±2.15% 



0.58%±0.10% 
0.95%±0.14% 
1.22%±0.19% 
1.25%±0.14% 



1.15±0.22 
1.27±0.23 
1.38±0.23 
1.11±0.22 



BAH mean return:1.18 ± 0.87% 
BAH mean MDD: 6.59 ± 0.55% 
BAH mean ASR: 0.78 ± 0.3 



Tabic 7: Performance of the ANN model with a single network constructed with various 
hidden layer sizes; performance of the ANN model with various ensemble sizes 
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segment). For simplicity of implementation, in our statistics we ignored such 
trades, after observing that the results with or without them are very close. 

The second problematic situation is when no signals were generated at 
all during a test segment. In such instances the return of the strategy and 
the maximum drawdown are zero, while the Sharpe ratio is not well defined, 
because the standard deviation of the cumulative return (i.e., risk) is zero 
as well. Hence, we omitted such instances when calculating and averaging 
Sharpe ratios. The same consideration and treatment was applied to the suc- 
cess rate metric. Here, when no trades are performed during a test segment, 
the success rate is 100% but is vacuous. Therefore, to be fair, we discarded 
these periods as well when averaging the success rate. An alternative treat- 
ment in the case of the Sharpe ratio could be to define the Sharpe ratio of 
such periods as zero and include it in the statistics. Similarly, we could define 
the success rate of empty segments as zero as well. However, this treatment 
does not make much sense] 10 ! 




To quantify the extent of test periods with undefined Sharpe ratios, con- 
sider two settings of SVR prediction defined with impact turning points of 
resolutions 7 = 0.01 and 7 = 0.02 (see Table |B~1~2]) . For 0.01-impact turning 
points there are 23 test segments without trades (8% of all segments). The 
average BAH return and Sharpe ratio over these segments only are 0.9% and 
1.19, respectively. If we discard these segments when calculating the SVR 
mean Sharpe ratio we obtain 1.34. However, if we account these undefined 
Sharpe ratio as zeros, the result is 0.8. The BAH Sharpe ratio for all test seg- 
ments is 1.01. When considering the setting with 0.02-impact turning points, 
there are 67 segments (22%) without trades (we have more such segments 
in this setting because turning points occur less frequently so there are less 
trades). The average BAH return and Sharpe ratio for these empty segments 
are 1.2%, and 1.12, respectively. When we substitute zero for these undefined 
Sharpe ratios the resulting mean Sharpe ratio for SVR in all periods is 1.02, 
as opposed to 1.67 when ignoring these empty segments. 

For ANN prediction with impact turning points the analogous results 
follows. For 7 = 0.01, there are (on average) 26.4 test segments 
without any trades. The average BAH return and Sharpe ratio over these 



10 If we do count the Sharpe ratio of such test periods as zero, the resulting average 
Sharpe ratio of SVR and ANN do not perform significantly better than the average BAH 
Sharpe. 
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segments are 2.68% and 1.48, respectively. If we discard these segments, the 
Sharpe ratio obtained is 1.14. However, if we consider these Sharpe ratios 
as zeros, the result is 0.6. When considering the setting with 0.02 impact 
turning points, there are 51.4 segments without trades on average. For these 
segments the average BAH return is 2.32%, the average Sharpe ratio is 1.36. 
When takeing these Sharpe ratios as zeros, the resulting ratio is 0.67. When 
ignoring these segments the average Sharpe ration is 1.13. 

10. Concluding remarks 

Drawing on, and extending useful ideas of Li et al. (2} , we proposed and 
studied a prediction model for turning points that relies on support vector 
regression (SVR). Extensive empirical examination of the proposed model 
showed that it outperforms the Li et al. neural network model for the same 
prediction problem. This advantage is statistically significant. While our 
SVR model achieves higher average return than the buy-and-hold benchmark, 
the difference between the two is not statistically significant. However, the 
SVR model suffers from significantly lower drawdowns and exhibits higher 
risk adjust return (Sharpe ratio) than the buy-and-hold. This significantly 
lesser risk may give rise to substantially larger return as well using leverage. 

Our studies included a complete reproduction and a thorough analysis 
of the ANN prediction model of Li et al. j2j. This model is considered the 
state-of-the-art in turning point prediction. Our tests included an empirical 
valuation of that model over multiple periods (as opposed to a single period in 
the original paper) Our conclusions differ than those of Li et al. in regards 
to the performance of their model, and in particular, are less optimistic than 
their conclusions. Nevertheless, their ANN model contains interesting and 
useful ideas that have been utilized here and laid the foundations of the 
present work. 

Our work can be extended and modified in various interesting ways. First, 
it would be interesting to see if better representation can be constructed 
that utilizes other price sequences and economical indicators. Such inter- 
market models are considered be more powerful than autoregressive models. 
It would also be very interesting to examine lower time-frames such as hourly 
prices. While intraday data is considered to be more noisy, it contain more 



11 More precisely, the Li et al. empirical evaluation was confined to two price sequences 
(DJIA and TESCO) and for each sequence a single test period was considered. 
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fluctuations that could be identified by our model. Finally, it would be 
interesting to include rejection mechanisms in the spirit of Chow (see, e.g., 



25l . |26| that can increase prediction accuracy by avoiding prediction in cases 



of uncertainty. 
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Appendix A. An algorithm for extracting alternating pivots 

We are given a time series and would like to extract an alternating se- 
quence of peaks and troughs that satisfy a given property. Throughout the 
description of the algorithm we only consider pivots that satisfy the given 
property. The first phase of this procedure extracts an alternating sequence 
of peaks/troughs. The second phase improves the outcome by considering 
better (higher or lower) alternatives for each pivot. 

Extract initial alternating sequence of peaks and troughs. 

1. Identify the first pivot in the sequence and assume w.l.o.g. that it is a 
trough T\. 

2. Find the subsequent peak Pi. 

3. Discard all troughs appearing between the trough T\ and the peak P\. 
Denote by D the set of all discarded peaks and troughs. 

4. Let T 2 be the subsequent trough. Discard all peaks between Pi and T 2 . 

5. Repeat these steps steps until there are no more peaks and troughs. 

Peaks/Troughs improvement. Consider the alternating sequence of peaks 
and troughs computing as described above. For each pair of subsequent 
troughs Tj and Tj there exists a unique peak P k appearing between the Tj 
and Tj (due to alternation). Replace P k by Py if < Py (i.e. Py is a higher 
peak) and Py G D (i.e., it was discarded in step (3) of the first phase). Apply 
the analogous procedure to improve troughs. 



39 



Appendix B. Additional experimental details 

Here we provide complimentary details for the experiments described in 
sections I9~7TT1 I5T21 and I5T51 

Experiment 1: SVR vs. ANN vs. BAH.. In Experiment 1 (Section 19.7.11) 
we compared the performance of the SVR model to the performance of the 
ANN model. In addition, we compared each of these algorithms to the BAH 
benchmark. In Table IB. 81 the p- values for the Wilcoxon signed rank test are 
presented for each of the three comparisons (ANN vs. SVR, BAH vs. ANN, 
and BAH vs. SVR), and for each of the performance metrics (Mean Return, 
Mean MDD, Mean ASR). Whenever the p- value obtained in the test is less 
than 0.05, we can conclude that the performance of two compared algorithms 
is significantly different (such p- values appear in boldface font), otherwise 
we conclude that the performance is statistically indistinguishable at 95% 
confidence level. 

Experiment 2: SVR Backward window size.. In Experiment 2(Section 19.7.21) 
we examined the performance of the SVR prediction model as a function 
of the backward window length. This examination was repeated for differ- 
ent pivot point types. Statistical analysis was performed separately for each 
pivot point type. Two statistical tests were conducted: the Wilcoxon signed- 
rank test to compare the performance of SVR and BAH (each one of the 
performance metrics was compared to the corresponding BAH metric), and 
the Friedman test to compare the SVR performance for each pivot point 
type. The Friedman test has two stages. In the first stage, the null hypoth- 
esis is that "all algorithms have the same performance and the results differ 
due to a chance." If the null hypothesis is not rejected at 95% confidence 
level, we conclude that all the algorithms in the group are statistically in- 
distinguishable. Otherwise, if the hypothesis is rejected, a post-hoc test is 
performed that divides the group into two or more subgroups such that in 
each subgroup the performance is indistinguishable at 95% level. In case that 
such a subgroup exist, we denote the best subgroup members with boldface 
in Table MM 
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pValues Return MDD ASR 



ANN vs SVR 



0.1% 


0.6684 


0.03 


0.4798 


1% 


0.0101 


0.019 


0.035 


2% 


0.029 


0.013 


0.044 


5% 


0.522 


0.162 


0.0464 


10% 


0.0192 


0.21 


0.51 


BAH vs ANN 








0.1% 


0.595 


2.27E-15 


0.5073 


1% 


0.947783 


4.25E-16 


0.055 


2% 


0.865876 


8.00E-17 


0.0135 


5% 


0.909438 


7.55E-17 


0.0954 


10% 


0.722563 


3.17E-16 


0.8875 


BAH vs SVR 








0.1% 


0.375 


2.09E-9 


0.8899 


1% 


0.564861 


3.30E-17 


0.0028 


2% 


0.7482 


5.31E-17 


2.20E-05 


5% 


0.501418 


7.98E-17 


0.0918 


10% 


0.0277 


4.98E-21 


0.265 



Table B.8: Experiment 1: p-values of the Wilcoxon signed-rank test comparing pairwise 
performance of ANN, SVR and the BAH benchmark. 
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Wilcoxon Signed-Rank Test Friedman ranks 



Backward window 


Return 


MDD 


ASR 


Return 


MDD 


ASR 


Impact 


4 


0.5348 


2.15E-48 


0.0164 


2.09 


2.2 


2.05 


8 


0.8814 


9.65E-43 


0.0001 


2.04 


2.07 


2.09 


50 


0.3776 


3.67E-38 


0.128 


1.87 


1.73 


1.86 


Momentum 














4 


0.469 


2.12E-40 


0.02 


2.09 


2.27 


2.14 


8 


0.0855 


2.13E-34 


0.0118 


1.95 


2.04 


1.98 


50 


0.0723 


1.12E-22 


0.1717 


1.96 


1.69 


1.88 


Pivot 














4 


0.3921 


7.84E-39 


0.0289 


2.05 


2.24 


2.09 


8 


0.0672 


5.73E-31 


0.0046 


2.02 


2.03 


2.01 


50 


0.0513 


2.30E-21 


0.6463 


1.93 


1.74 


1.9 



Table B.9: Experiment 2: Performance dependence on backward window length, p-valucs 
of the BAH comparison, and the Friedman rank test. The values representing statistical 
significance p < 0.05 for the Wilcoxon and Friedman post-hoc tests are marked with 
boldface. 
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Experiment 3: SVR training segment length. Experiment 3(Section I9.7.3P 
results presentation is similar to presentation of the results for Experiment 2 
in Table IB~9l The results of Wilcoxon test and Friedman test are summarized 
in Table lB~T0l 



Wilcoxon Signed-Rank Test Friedman ranks 



Train set length 


Return 


MDD 


ASR 


Return 


MDD 


ASR 


Impact 


0.5 year 


0.9101 


6.33E-44 


0.0077 


2.62 


2.51 


2.53 


1 year 


0.7206 


1.37E-46 


0.0045 


2.46 


2.49 


2.43 


2 years 


0.8814 


9.65E-43 


0.0001 


2.36 


2.54 


2.44 


5 years 


0.2713 


8.13E-44 


0.0001 


2.56 


2.46 


2.6 


Momentum 














0.5 year 


0.2074 


5.02E-33 


0.0488 


2.45 


2.63 


2.4 


1 year 


0.1389 


2.66E-35 


0.004 


2.52 


2.42 


2.55 


2 years 


0.0855 


2.13E-34 


0.0118 


2.4 


2.55 


2.4 


5 years 


0.5036 


2.48E-33 


0.003 


2.63 


2.4 


2.64 


Pivot 














0.5 year 


0.1771 


2.94E-29 


0.009 


2.53 


2.49 


2.42 


1 year 


0.2106 


2.78E-31 


0.002 


2.59 


2.53 


2.59 


2 years 


0.0672 


5.73E-31 


0.0046 


2.44 


2.51 


2.50 


5 years 


0.0529 


7.57E-27 


0.0557 


2.44 


2.47 


2.49 



Table B. 10: Experiment 3: Performance dependence on training segment length. Left part 
contains p-values for BAH comparison, right part contains the Friedman ranks. Whenever 
statistical significant difference is achieved, the results are marked with boldface. 
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Experiment 5: ANN ensemble size. Table Appendix B contains the data 
segments along with their chaotic properties that were used in the experiment 
described in Section 19.7.51 



Training start 


Test start 


EmbcddingDimcnsion 


Delay 


LLE 


Look-ahead 


23-Sep-1958 


29-Feb-1960 


7 


3 


0.17 


6 


Ol-Sep-1960 


08-Feb-1962 





3 


0.16 


6 


25-Jul-1961 


28-Dec-1962 


7 


5 


0.19 


5 


22-Jul-1963 


24-Dec-1964 


(i 


5 


0.15 


7 


26-Jul-1963 


31-Dec-1964 


(i 


3 


0.14 


7 


20-Aug-1964 


25-Jan-1966 


5 


2 


0.19 


5 


31-Jan-1966 


06-Jul-1967 


6 


3 


0.18 


6 


03-Jul-1967 


15-Jan-1969 


5 


3 


0.22 


5 


13-Dec-1967 


30-Jun-1969 


6 


2 


0.19 


5 


20-Feb-1970 


26-Jul-1971 


7 


2 


0.16 


(i 


13-,Tan-1971 


14-Jun-1972 


5 


4 


0.21 


5 


15-Jan-1971 


16-Jun-1972 


5 


4 


0.21 


5 


18-Mar-1971 


17-Aug-1972 


6 


4 


0.21 


5 


23-Aug-1972 


30-Jan-1974 


6 


4 


0.21 


5 


22-Nov-1972 


Ol-May-1974 


7 


3 


0.2 


5 


12-Apr-1973 


16-Sep-1974 


6 


5 


0.21 


5 


16-May-1974 


17-Oct-1975 


7 


2 


0.18 


(i 


22-Jul-1974 


22-Dec-1975 


7 


2 


0.19 


5 


10-Feb-1975 


14-Jul-1976 


7 


3 


0.18 


(i 


04-Mar-1975 


04- Aug- 1976 


5 


3 


0.17 


6 


ll-Nov-1975 


15-Apr-1977 


5 


2 


0.18 


(i 


27-Dec-1976 


31-May-1978 


5 


6 


0.17 


6 


27-Mar-1978 


27-Aug-1979 


5 


2 


0.22 


5 


24-Apr-1978 


25-Sep-1979 


5 


2 


0.22 


5 


05-Feb-1979 


09-Jul-1980 


5 


2 


0.2 


5 


28-Sep-1979 


04-Mar-1981 


7 


3 


0.18 


6 


19-Feb-1982 


22-Jul-1983 


6 


3 


0.16 


(i 


31-Aug-1982 


Ol-Feb-1984 


6 


2 


0.16 


6 


02-Sep-1983 


05-Feb-1985 


5 


2 


0.2 


5 


12-Nov-1984 


18-Apr-1986 


6 


3 


0.16 


6 


19-Dec-1986 


24-May-1988 


6 


1 


0.16 


6 


20-Jan-1988 


22-Jun-1989 


6 


4 


0.17 


6 


27- Feb- 1989 


31-Jul-1990 


5 


2 


0.19 


5 


22-Dec-1989 


29-May-1991 


6 


3 


0.19 


5 


27-Aug-1990 


29-Jan-1992 


5 


1 


0.16 


6 


30-Dec-1991 


02-Jun-1993 


7 


3 


0.18 


(i 


05-Aug-1993 


09-Jan-1995 


5 


4 


0.17 


6 


06-Sop-1994 


07-Feb-1996 


5 


3 


0.15 


7 


ll-May-1995 


ll-Oct-1996 


6 


2 


0.14 


7 


05-May-1997 


07-Oct-1998 


5 


3 


0.16 


(i 


13-Oct-1998 


17-Mar-2000 


7 


4 


0.18 


(i 


28-Jun-2000 


06-Dec-2001 


6 


3 


0.21 


5 


Ol-Apr-2002 


03-Sep-2003 


6 


1 


0.17 


6 


23-Aug-2002 


29-Jan-2004 


5 


4 


0.16 


(i 


29-Apr-2003 


Ol-Oct-2004 


7 


5 


0.14 


7 


22-Apr-2004 


26-Sep-2005 


7 


2 


0.19 


5 


24-Jun-2004 


25-Nov-2005 


7 


1 


0.19 


5 


25-Oct-2004 


30-Mar-2006 


7 


1 


0.18 


6 


25-May-2005 


27-Oct-2006 


7 


2 


0.17 


6 


Ol-Sep-2005 


08-Feb-2007 


6 


2 


0.18 


6 



Table B.ll: Details of the data segments used for ANN method 



44 



7 Mean Return 



Success Rate 



Mean MDD Mean ASR 



Turning points of impact 7 

1% 1.55% ±0.26% 69.43% ±1.97% 1.60%±0.17% 1.38±0.12 

2% 1.54% ±0.25% 74.03% ±1.99% 1.65%±0.18% 1.62±0.13 

5% 0.50% ±0.21% 70.06% ±2.44% 1.26%±0.20% 1.25±0.14 

10% 0.81% ±0.18% 71.22% ±2.41% 0.61%±0.11% 1.61±0.13 



Momentum points of impact 7 

1% 1.46% ±0.28% 67.81% ±2.01% 1.65%±0.18% 1.26±0.13 

2% 1.43% ±0.26% 70.09% ±2.21% 1.57%±0.18% 1.57±0.13 

5% 0.82% ±0.15% 75.47% ±2.20% 0.65%±0.11% 1.63±0.13 

10% 0.23% ±0.06% 77.59% ±2.35% 0.23%±0.07% 1.44±0.09 

BAH mean rcturn:1.44 ± 0.44% 
BAH mean MDD:8.2 ± 0.41% 
BAH mean ASR:1.01 ± 0.15% 



Table B.12: Performance comparison between the SVR and ANN models for turning points 
with impact and momentum characteristics. Whenever a result of one of the methods is 
better at 5% level, according to Wilcoxon signed-rank test, we mark it with (*). Boldface 
numbers mark results that exceed the corresponding BAH metric at 5% level. 
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This figure "diagram5-peakTrough.JPG" is available in "JPG" format from: 



http://arxiv.org/ps/1209.0127v2 



